Mon. Not. R. Astron. Soc. OOP. [TlfTSl C2007') Printed 1 February 2008 (MN I*TeX style file v2.2) 



Testing the Accuracy of Synthetic Stellar Libraries 



Lucimara P. Martins^*, Paula Coelho^'^ 



^ Universidade de Sao Paulo, I AG, Rua do Matao 1226, Sao Paulo 05508-900, Brazil; 

^ Max- Planck- Institut fiir Astrophysik, Karl-Schwarzschild-Strasse 1, 85740 Garching bei Miinchen, Germany 



ABSTRACT 

One of the main ingredients of stellar population synthesis models is a library 
of stellar spectra. Both empirical and theoretical libraries are used for this purpose, 
and the question about which one is preferable is still debated in the literature. Em- 
pirical and theoretical libraries are being improved significantly over the years, and 
many libraries have become available lately. However, it is not clear in the literature 
what are the advantages of using each of these new libraries, and how far behind are 
models compared to observations. Here we compare in detail some of the major theo- 
retical libraries available in the literature with observations, aiming at detecting weak- 
nesses and strengths from the stellar population modelling point of view. Our test is 
twofold: we compared model predictions and observations for broad-band colours and 
for high resolution spectral features. Concerning the broad-band colours, we measured 
the st ellar colour given by three recent sets of model atmospheres and flux dis tribu- 
tions (jCasteUi fc Kuruczll2003HGustafsson et al.ll2003tlBrott fc Hauschildl^'2005'). and 



compared them with a recent UBVRIJHK calibration (jWort hev & Lec 2007) which 
is mostly based on empirical data. We found that the models can reproduce with 
reasonable accuracy the stellar colours for a fair interval in effective temperatures 
and gravities. The exceptions are: 1) the U-B colour, where the models are typi- 
cally redder t han the observations, and; 2) the very cool stars in general (V-K > 3). 
ICastelh fc Ku r ucz (2003) is the set of models that best re produce the bluest colours 
(U-B, B-V) while iGustafsson et al.l (|2003f ) and lBrott fc HauschiTdtl (|2005t) more accu- 
rately predict the visual colours. The three sets of models perform in a similar way 
for the infrared colours. Concerning the high resolution spectral features, we mea- 
sured 35 sp ectral indi ces defined i n the lite rature o n three h igh resolution synthetic 
libraries (.Coelho et al. 2005: Mar tins et al.| [2005: Munari et a l. 2005), a nd compared 
thern with the observed measurements given by three empirica l libraries (jValdes et al.l 
12004 ISanchez-Blazauez et al.ll20Q6HPrugniel fc Soubiranll2001[ ). The measured indices 
cover the wavelength range from ~ 3500 to ~ 8700 A. We found that the direct com- 
parison between models and observations is not a simple task, given the uncertainties 
in parameter determinations of empirical libraries. Taking that aside, we found that 
in general the three libraries present simil ar behaviours and s ystematic deviations. 
For stars with Toff < 7000K, the library bv lCoelho et all (|2005^ is the one with best 
average performance. We detect that lists of atomic and molecular line opacities still 
need improvement, specially in the blue region of the spectrum, and for the cool stars 
(Tefi < 4500K). 
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1 INTRODUCTION 

Evolutionary population synthesis models describe the 
spectral evolution of stellar systems, and are fundamental 
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tools in the analysis of bot h near by and distant galax- 
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formation history and chemical enrichment in a variety of 
systems, from early type galaxies and spiral bulges to star 
forming galaxies at different redshifts. 

Libraries of stellar spectra are one of the main ingre- 
dients of stellar population models, and both empirical and 
theoretical libraries have improved dramatically in recent 
years, allowing the construction of more detailed models. 
Observations are also becoming increasingly better and de- 
manding more from the modelling point of view. 

Recently, many new empirical libraries suitable 
to stellar population synthesis have been made avail- 
able with improved spectral resolution and param- 
eter coverage: e.g. STE LIB (|Le Borgne et all l2003l) 
UVE S POP llJehin et al.l |2005^■ Indo - US llValdes et al.1 
I2OO4I). Elodie (iPrugniel fc SoubiranI l200l[). MILES^ 
dSanchez-Blazauez et al.1 I2OO6I '). and NGSL (jCregg et aD 
I2OO4I ). 

The choice of using either an empirical or a synthetic 
library in stellar population models is a subject of debate. 
Many aspects are important when considering a library for 
stellar population synthesis, and parameter coverage is one 
of the main issues. A good parameter coverage is not triv- 
ial for empirical libraries, which are limited to whatever is 
possible to obtain given observational constraints (resolu- 
tion, wavelength coverage, exposure time, etc.). They have 
to cover not only a good wavelength range (which limits the 
spectral resolution), but also cover from cool to hot stars, 
dwarfs and giants, and different chemical abundances. 

Amongst the synthetic libraries, perhaps the most 
widely u sed i s the flux distribution predicted by the 
iKurucd HlM) model atmospheres. The BaS eL library 
i Leieune et al l Il997l . Il998l : IWestera et al] |2002| ) extended 
these flux distributions including s pectra of M stars 



computed with model atmosp heres by Fluks et al. (1991 , 
[BcsscU ct al. (1989, 199^ and I AUard fcH auschildt (1991). 
However the spectral resolution of the BaSeL library is 
limited to ~ 20 A, which is by far lower than the modern ob- 
served spectra of both individual stars and integrated stellar 
populations. Resolution ceased to be a limitation recently, 
with many high-resolution theoretical libraries appear- 
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updated line lists, state of the art model atmospheres and 
spectral synthesis codes, and a very extensive parameter 
coverage. A qualitative comparison of some of th e recent 
high resolution synthetic libraries is given by iBertonel 
l|2006t) . 

The major concern when using synthetic libraries for 
high resolution stellar population models is to know whether 
a synthetic library can safely replace an empirical one. 
These libraries are based on model atmospheres and there- 
fore are limited to the approximations adopted in the com- 
putations. Ideally, one would like to generate models that 
accounts for all the effects taking place across the HR 
diagram: non- Local Thermodynamic Equilibrium (NLTE), 
line-blanketing, sphericity, expansion, non-radiative heating, 
convection, etc. Such an approach is unfeasible at present 



time, even if the astrophysical models were available. What 
is usually done is to take some of these effects into account 
where they matter the most. The hardest stars to reproduce 
in this sense are the very hot and very cool stars, where 
extreme scenarios take place (e.g. non-LTE effects for very 
hot stars, and sphericity for cool giants). Additionally, com- 
puting reliable high-resolution synthetic spectra is a very 
challenging task, since it requires building an extensive and 
accurate list of atomic and molecular line opacities. 

Nevertheless, synthetic libraries overcome limitations of 
empirical libraries, for instance their inability to cover the 
whole space in atmospheric parameters, and in particular 
abundance patterns that differ from that of the observed 
stars (mainly from the solar neighbourhood, and in some few 
cases from the Magellanic Clouds). Therefore, population 
models based solely on empirical libraries cannot reproduce 
the integrated spectra of systems that have undergone star 
formation histories different than the Solar Neighbourhood. 

With so many different choices for the stellar library, 
the stellar population modeller might feel lost about which 
library should be used. It is certain that each of these li- 
braries have its own strengths and weaknesses, but identi- 
fying them is not always trivial. We propose in this work 
to make a detailed comparison between some of the major 
synthetic stellar libraries available, comparing them against 
empirical libraries. 

This paper is organised as follows; in §2 we present an 
overview of theoretical libraries. In §3 the model predic- 
tions of three sets of model a t mospheres teastel li fc Kurxic3 
I2OO3I : IGustafsson et al.ll2003l : iBrott fc H auschildtl l2005l ) for 
broad-band colours are compared to the em pirical UB- 
VRIJHK relation from IWorthev fc Le3 l|200'it l. In §4 we 
compare model spectral indices predicted by three recent 
high-resol ution libraries ijCoelho et al.|[2005l ; iMartins et al.l 
[2005; Mu nari et al.ll2005l') to indices measured m the empiri- 
cal libr aries by Valdes et al.l (12004 ) ; ISanchez-Blazquez et al.l 
( 200d ) ; iPrugniel fc SoubiranI l|200ll ) . 

For the purpose of the present work, we focus our com- 
parisons on the solar metallicity regime, where the complete- 
ness of the empirical libraries is higher, as well as the accu- 
racy of the stellar atmospheric parameters. Our conclusions 
and discussions are presented in §5. 



2 OVERVIEW OF THE THEORETICAL 
LIBRARIES 

The nomenclature used by atmosphere and synthetic spectra 
modellers are sometimes confusing for the stellar population 
models users. 

By model atmosphere we mean the run of temperature, 
gas, electron and radiation pressure, convective velocity and 
flux, and more generally, of all relevant quantities as a func- 
tion of some depth variable (geometrical, or optical depth 
at some special frequency, or column mass). The flux dis- 
tribution or synthetic spectra is the emergent flux predicted 
by a model atmosphere, and is required for comparison with 
observations. 

It is convenient from the computational point of view 
to split the calculation of a synthetic spectra in two major 
steps; the calculation of the model atmosphere, commonly 
adopting Opacity Distribution Function technique (ODF, 
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IStrom fc Kurucz|[l96^ ') — and the calculation of the emer- 
gent flux with a spectral synthesis code. 

Alternatively, model atmosphere codes that use an 
Opacity Sam pling (OS) method to ac count for the line ab- 
sorption (e.g. Ijohnson fc KrupplllQTGl ) can directly produce 
as output a well sampled flux distribution. The OS technique 
is more time consuming from the computational point of 
view then the ODF technique, but allows for a much larger 
flexibility in modelling. For example, peculiar chemical com- 
positions can be easily consideredthat'. 

The majority of model atmospheres available are ID 
and hydrostatic, assume LTE and treat convection with the 
mixing length theory. The r nixing le n gth t heory was intro- 
duced in ATLAS6 code bv iKurucj (Il979l ). and is a phe- 
nomenological approach to convection in which it is assumed 
that the convective energy is transported by eddy "bubbles" 
of just one size, t requires an adjustable parameter qj/l, 
which represents the ratio between the characteristic length 
(distance travelled by an element of fluid before its disso- 
lution) and the scale height of the local pressure (Ifp). The 
parameter qa/l has to be set at different values to f it dif- 
ferent types of observations (jSteffen fc Ludwi3ll999l ). and 
no single value works well in all classes. An alternative con- 
vective model is Ful l Spect rum Turbulence, introduced by 
ICanuto fc Mazzitellil l|l99ll ) and adopted, for example, by 
NeMo grid of atmospheres l|Heiter et al.l l2002'). 

Throughout this paper we further distinguish a flux 
distribution from a synthetic spectrum. The flux distribu- 
tion is the spectral energy distribution predicted directly by 
a model atmosphere, and is commonly available together 
with the model atmospheres. T his is the case, for exam - 
ple, of the synthetic libra ries bv Castelli fc Kurucj IpOOSi ). 
iBrott fc Hauschildtl l|2005l) and lGustafsson et alj I (|2003l ). 

By synthetic spectrum we mean the flux calculated by 
a line profile synthesis code, using as input a model atmo- 
sphere and a refined atomic and molecular line list, that can 
be at some extend different from the line list adopted in the 
model atmosphere computation. It can also adopt different 
chemical compositions than the model atmosphere in order 
to account for small variations in the abundance pattern 
(as long as the difference is not enough to produce impor- 
tant changes in the structure of the atmosphere). This is 
the method commonly used in high resolution stellar spec- 
troscopy studies, and it is the case of the libraries from 
ICoelho et all (|2005l ). lMartins et al.l(|2005l ) and lMunari et all 
1 20051 ). A synthetic spectrum is usually computed at a higher 
resolution than a model atmosphere flux distribution, given 
that it aims at resolving individual line profiles. 

Additionally, a theoretical library that is intended to 
produce accurate high resolution line profiles is not gener- 
ally a library that also predicts good spectrophotometry. 
That happens because usually only the lower lying energy 
levels of atoms have been determined in laboratory. If only 
those transitions are taken into account in a model atmo- 
sphere, the line blanketing would be severely incomplete. To 
avoid this deficiency and to improve both the temperature 
structure of the model atmospheres and the spectrophoto- 
metric flux distributions, the computation requires account- 
ing for lines where one or both energy levels have to be 
predicted from quantum mechanical c alculations. Th ese so- 
called "predicted lines" (hereafter PLs, iKuruc 3 1 19921 ) are an 
essential contribution to the total line blanketing in model 



atmospheres and flux distribution computations. But as the 
theoretical predictions are accurate to only a few percent, 
wavelengths and computed intensities for these lines may 
be largely uncertain. As a consequence the PLs may not 
correspo nd in position and intensity to the observabl e coun- 
terparts iBell et al.|[l99i : ICasteUi fc Kurucj l2004al ). "pol- 
luting" the high resolution synthetic spectrum. Therefore, 
synthetic libraries that are aimed at high resolution studies 
do not include the PLs, and thus they provide less accurate 
spectrophotometric predictions when compared to the flux 
distributions libraries. 

For this reason we divided the comparisons of the 
present paper in two different sections. Section 3 studies 
the flux distributions given by some model atmosphere grids 
in order to assess the ability of those models in predicting 
broad-band colours. In Section 4 we change our focus to 
libraries that aim at high resolution studies, testing their 
ability to reproduce higher resolution spectral features. The 
grids evaluated in the present work are briefly described be- 
low. 



2.1 Model atmosphere flux distributions 

Amongst several model atmos p here grids available in 
literature (e.g. iKuruczl Il993l: iHau schildt e t all Il996l: 



Pauldrac h et all 1200 ll : iHeiter et all 12002: Lan z fc Hubenvl 
fioOSbl ). we selected three grids that cover a large pa- 
rameter space in effe ctive temperatures T^s a nd super- 
ficial grav ities log q: Castelli fc Kurucj (|2003l . hereafter 



ATLAS9), 'Gustafsson et al' ('2003', hereafter MARCS) and 
iBrott fc Ha uschildt (2005. hereafter PHOENIX). 

Based on lKurucj (|i99^ ) codes, the ATLAS9 model at- 
mospheres follow the classical approximations of steady- 
state, homogeneous, LTE, plane-parallel layers that extend 
vertically through the regio n where the lines are formed. In 
its more recent version (Castelli fc Kurxic3l2003l !n. oml is 
assumed to be 1.25 to fit the energy distribution from the 
centre of the Sun. All models are computed with the con- 
vection option switched on and with the overshooting option 
switched off. The convective flux decreases with increasing 
Tcft and it naturally disappears for Tcft ~ 9000K. The mod- 
els are available in the range 3500K ^ TcH ^5 50000K. 

Plane-parallel LTE models will fail wherever sphericity 
(specially important for giant stars) and non-LTE effects (for 
very hot stars) are evident. Two models that take sphericity 
into account are PHOENIX and MARC S. 

PHOENIX (|Hauschildt et al.lll996h is a multi-purpose 
stellar model atmosphere code for plane-parallel and spher- 
ical models. The original versions of PHOENIX were de- 
veloped for the modell ing of novae and supernovae ejecta 
(|Hauschildt et al.|[l999l . and references therein). The most 
recent grid is presented in lBrott fc Hauschildtl (l2005l fl. The 
equilibrium of Phoenix is solved simultaneously for 40 ele- 
ments, with usually two to six ionisation stages per element 
and 600 relevant molecular species for oxygen-rich ideal gas 
compositions. The chemistry has been gradually updated 
with additional molecular species since the original code. 
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The convective mixing is treated according to the mixing- 
length theory, assuming qail = 2.0. Both atomic and molec- 
ular lines are treated with direct opacity sampling method. 
PHOENIX models cover the range 2000K T^ff ^ lOOOOK. 

MARCS models hav e undergone several imp rovements 
since the original code bv lGustafsson et al.l l|l975h . the most 
important ones being the replacement of the ODF technique 
by OS technique, the possibility to use a spherically sym- 
metric geometry for extended objects, and major im prove- 
ments of the line and continuous opacities l|Plej|l99^ ). The 
common assumptions of spherical or plane-parallel stratifi- 
cation in homogeneous stationary layers, hydrostatic equi- 
librium and LTE are made. Energy conservation is required 
for radiative and convective flux, where the energy trans- 
port due to conv ection is treated thro ugh the local mixing- 
length theory bv lHenvev et al.| (Il965l ). The mixing- length I 
is chosen as 1.5Hp, which is a reasonable quantity to sim- 
ulate the temperature stru cture beneath the photosphere 
jNordlund fc Dravins|[l990l ). The most recent version of the 
MARCS grids is presented in lCustafsson et all (|2003l fl. The 
models cover 4000 ^ T^s ^ 8000K and adopt plane-parallel 
geometry for the dwarfs (log g ^ 3.0) and spherical geome- 
try for the giants (log g ^ 3.5; both geometries are available 
for log g values of 3.0 and 3.5). 

The three sets of models adopt a micro-turbulent veloc- 
ity of 2 km s~^ and are computed for 1 solar mass. 

2.2 High resolution synthetic spectral libraries 

Amongst the higher resolution synthetic libraries, we se- 
lected three of the most recent ones which are publicly avail- 
able, each of them with an outstanding imp rovement com- 
pared to previous ones. iMunari et al.l (|2005l . hereafter Mu- 
narij^ has an impre ssive coverage of the HR diagram. Their 
models are based on lKuruc3 (Il993l ) codes and ATLAS9 grid, 
covermg 2500 - 10500 A in wavelength range at a maximum 
resolution of R=20000. They range from 3500 K to 47500 K 
in Teff , with log g varying between 0.0 and 5.0 dex, for differ- 
ent values of metallicity, Q-enhancement, rotational velocity 
and micro-turbulen t velocity. 

The library bv lCoelho et al. I (|2005l . hereafter Coelho'j^, 
also based on ATLAS9 model atmospheres, had a special 
care for low temperature stars, employing a detailed and 
calibrated line list that has been improved along the years 
(see the original paper for a list of references) . Their models 
cover from 3000 A to 1.8 \im spanning from 3500 K to 7000 
K, with log g varying between 0.0 and 5.0 dex, also covering 
diff erent metallicit i es and Q-enhancement. 

iMartins et al.l (|2005l . hereafter Marim^jf] searched the 
literature for the best available codes for each range of tem- 
peratures and us ed them to build the m o dels. They used 
iHubenvl (|l988l ). iHubenv fc L"an3 (|l995l ). iLanz fc Hubeiivl 
l|2003al ) model atmospheres considering non-LTE for hot 
stars, ATLAS9 models for intermediate temperature stars 
and PHOENIX line-blanketed models for very cool stars. 



^ http://marcs.astro.uu.se/ 

" http://archives.pd.astro.it/2500-10500/ 

5 http://www.mpa-garching.mpg.de/PUBLICATIONS/ 

DATA/SYNTHSTELLIB/synthetic_steIlar _spectra.html 
^ http:/ /www. astro. iag.usp.br/~lucimara/library.htm 



The library covers from 3000 to 7000 A, with temperatures 
ranging from 3000K to 55000K and log g from -0.5 to 5.5 
dex, for 4 different metallicities (but no a-enhancement). 



3 EVALUATING THE FLUX DISTRIBUTIONS: 
BROAD BAND COLOURS 

A convenient way of comparing the flux distributions given 
by the model grids with observations is through broad-band 
colours, which are likely to be the flrst observables expected 
to be predicted by spectral stellar population models. 

In order to do this comparison, we selected pairs of 
Teff and log g that are representative of an isochrone of 
a young and an old population (10 Myrs and 10 Gyrs). 
The pairs were selected to uniformly cover TeH, respecting 
the spacing of each set of models (ATLAS9 and MARCS 
have steps of 250K, and PHOENIX ha s steps of 20qK). Th e 
isochrones adopted are the ones by iGirardi et al] (|2002l ). 
for solar metallicity composition. The transformation to ob- 
served colours we re done adoptin g the U BVRIJHK empiri- 
cal calibration bv lWorthev fc Led l|2007l . hereafter WL07)[!|. 
In that work, the authors used stars with measured photom- 
etry and known metallicity [Fe/H] to generate colour-colour 
relations that include the abundance dependence. They fur- 
ther added colour-temperature relations until the whole pa- 
rameter range was covered, taking medians in regions where 
more than one relation applied. The colour— T^ff relations 
were obtained from several sources in literature, mainly from 
empirical work, but also from theoretical work. At both ends 
of the Teff range, the relations were taken purely from empir- 
ical sources; in the middle range, the theoretical relations by 
VandenBerg fc Clem (2003) for V-I were added, and behaved 
well compared to empirical ones. Any other theoretical re- 
lation employed was used with a lesser weight (G. Worthey, 
priv. comm. See also Figures 7 and 8 in WL07). Therefore, 
we expect the relations by WL07 to be a close match to 
observations, and that the theoretical relations, which could 
bias our comparisons, do not have an important weight. 

The magnitudes predicted by ATLAS9, MARCS and 
PHOENIX grids were measured using the IRAF task sbands, 
adopting the filter transmission curves of the photomet- 
ric systems adopted in WL07. Zero point corrections were 
ap plied to the mode l magnitudes using the Vega model 
bv ICasteUi fc Kurucj (|i994fl and adopting Vega magni- 
tudes : ^Johnson ~ 0.02, ^Johnson ~ 0.03, '^Johnson 

' ' Rcousin = 0.039, Icousin ~ 0.035, JflesseH = 0.02, 
Hsesseii = 0.02, KBessell ~ 0.02. 

The comparison between the empirical relation and the 
model predictions are given in Figures [T] and [2] for the 10 
Myrs and 10 Gyrs isochrones respectively. The empirical 
relation is presented as black circles. ATLAS9 predictions 
are given in red diamonds, blue squares are predictions for 
MARCS models, and green triangles for PHOENIX. Filled 
and open symbols represent dwarfs (log g ^ 3.0) and giant 
stars (log g < 3.0), respectively. 

The results are presented in colour-colour relations 
where on the x axis is shown the (V-K) colour, which is a 

^ Colour-temperature table and interpolation program available 
at http://astro.wsu.edu/models/ 

* Available at |http : / / wwwuser . oat Its, astro . it / castelli / vega. html | 
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good tracer of Tcff (higher values of Tcs correspond to lower 
values of V-K). The six panels in each figure show difi'erent 
colours in the y axis. The residuals (model minus empiri- 
cal) between the model colours and the WL07 calibration 
for each Tcff, log g pair is shown below each colour-colour 
panel, where the error bars indicate the uncertainties of the 
WL07 cahbration. 

For stars 4000 K sC T^ff ^ 8000 K, which is the interval 
that is common to all sets of models, we present in Tables 
[1] and [2] the average absolute differences between model and 
empirical relations, for the 10 Gyrs and 10 Myrs populations 
respectively. 

It can be seen from Figures [1] and [2] that the three set 
of models show a similar behaviour among themselves for 
a large range in Tcfr (V-K), and are a good reproduction 
of the empirical relation for the colours V-I, V-R and J-K. 
The residuals are larger for cooler stars (V-K > 3), for all 
colours. There is a tendency in all models to under-predict 
the B-V and H-K colours. The colour where the models dif- 
fer more strongly is U-B: in the case of Figure [2] (10 Gyrs 
isochrone), we note that in the range 1 < V-K < 3 (which 
mainly represents the turn-off stars) ATLAS9 models re- 
produce considerably better the observations than either 
PHOENIX or MARCS. The situation is more complex for 
the same colour in the young population (Figure [ij and all 
residuals are larger, specially for the giants. In the case of 
the dwarfs, ATLAS9 is still the set of models that best re- 
produces the empirical relation. The differences are typically 
smaller for the visual colours, and for V-I and V-R colours 
ATLAS9 presents on average higher residuals than MARCS 
or PHOENIX, likely due to the different implementations 
of molecular opacities. For the near-infrared colours, the be- 
haviour is quite similar for the three sets of models. 

The reason for the large difference in the U-B colour 
is unclear to the present authors. Differences in the im- 
plementation of both line blanketing and continuum opac- 
ities, and also differences in calibration of the convection 
treatment might be playing a role. The effect of both line 
blanketing and continuum opacities in the near-UV and 
UV fluxes is a long standing (and sometimes confusing) 
problem. Each set of models has its particular implemen - 
tation, and we refer the reader to 'Houda shelt et all ( 



Table 1. Mean absolute residuals for the broad- band colours. 
These values were obtained for the 10 Myrs isochrone and for the 
interval 4000K ^ T<,ff^ 8000K. 
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Garcfa-Gil et al.l (|2005) and references therein for appropri- 
ate discussions on the subject. The effect of the convection 
treatment o n broad- band colou rs have been discussed, for 
example, in iHeiter et al.l (|2002l ). and indeed we note that 
the three sets of models present diff erent values of the mix - 
ing length parameter oml- However, iKucinskas et al.l (|2005l ) 
have shown that the efi'ect of different uml is not significant, 
and important effects appear only when more realistic 3D 
computations take place. Nevertheless, they focused their 
analysis in late- type giants, and therefore it remains an open 
question if different om l could explain the differences we see 
here for the parameters typical of turn-ofi' stars. 

Concerning the cooler stars, usually around V-K ~ 3 
(Toff ~ 4250K) the models start to deviate from the empir- 
ical relation. It is interesting to see that the model predic- 
tions are not strikingly different among the sets of models 
analysed here (at least for Tefj ^ 3500K), even though AT- 
LAS9 models are computed in plane-parallel geometry and 



Colour 


ATLAS9 


MARCS 


PHOENIX 


Mean error 


U-B 


0.370 


0.695 


0.611 


0.073 


B-V 


0.070 


0.145 


0.066 


0.020 


V-I 


0.041 


0.029 


0.010 


0.015 


V-R 


0.022 


0.045 


0.026 


0.012 


J-K 


0.049 


0.056 


0.079 


0.013 


H-K 


0.018 


0.016 


0.019 


0.004 



Table 2. Mean absolute residuals for the broad-band colours. 
These values were obtained for the 10 Gyrs isochrone and for the 
interval 4000K ^ T^ff ^ 8000K. 



Colour 


ATLAS 


MARCS 


PHOENIX 


Mean error 


U-B 


0.105 


0.440 


0.309 


0.073 


B-V 


0.146 


0.235 


0.126 


0.020 


V-I 


0.048 


0.015 


0.009 


0.015 


V-R 


0.038 


0.017 


0.016 


0.012 


J-K 


0.023 


0.027 


0.034 


0.013 


H-K 


0.024 


0.022 


0.018 


0.004 



PHOENIX models in sph erical geometry (MARCS models 
present both geometries). IKucinskas et al.l 12005) present a 
very detailed analysis of broad-band colours for late-type 
giants, and test the effect of several model parameters on 
the broad-band colours predictions (namely molecular opac- 
ities, micro-turbulent velocities, stellar mass and treatment 
of convection). Those authors note that it is possible that 
spherical models may not be sufficient, and additional effects 
as convection, variability and mass loss, become increasingly 
important for cooler stars. 



4 EVALUATING THE HIGH RESOLUTION 
FEATURES: SPECTRAL INDICES 

A convenient way to evaluate the theoretical spectra is to 
measure widely used spectral indices and compare them with 
the observed values. This approach will not evaluate the 
quality of the model spectrum at its full wavelength cover- 
age, but allows a presentation of the results in a scale that 
is familiar to the user of stellar population models. 

We compared Coelho, Martins and Munari libraries 
with three of the most complete empirical libraries avail- 
able; Indo-US, MILES and Elodie. 



4.1 Overview of the Empirical Libraries 

The first empirical stellar librar y that provi ded fiux cali- 
brated spectra was presented in I Jone j (|l998l ). With mod- 
erately h i gh re solution (1.8 A), this library was used by 
IVazdeki^ l|l999l ) to produce for the first time spectral stel- 
lar population models at high resolution. However, Jones 
library is limited to two narrow wavelength regions (3820- 
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Figure 1. Comparison between the colours predi cted by syntlieti c flux distributions and an empirical colour-temperature relation, for 
stars representing a 10 Myrs isochrone from Girardi et alj l l2002l) . Red diamonds correspond to ATLAS9 models, green triangles to 
PHOENIX models and the blue squares to MARCS models. Filled and open symb ols represent dwarfs (lo g g ^ 3.0) and giant stars (log 
g < 3.0), respectively. Circles are the values expected from the empirical relation o f lWorthev fc Led ||2007| '). On the bottom of each colour 
plot it is shown the residuals (difference between the models and the values obtained through the empirical calibration). The thin black 
vertical lines in this plot represent the error bars of the empirical calibration. 



4500 A and 4780-5460 A), and it's sparse in dwarfs hotter 
than about 70 00 K and metal-poor g iants. 

STELI^ l|Le Borgne et al.ll2003l ) represents a substan- 
tial improvement over previous libraries. It consists of 249 
stellar spectra in the range of 3200 A to 9500 A, with an 
spectral resolution of abo ut 3A (R=2000). This is t he base 
library for the widely used lBruzual fc Chariot! ()2003l l stellar 
population models. 

Following this work, IValdes et"all (|2004l ) pubUshed 
Indo-US 0, a library with resolution down to FWHM ^ 
lA and a good coverage of the colour magnitude diagram. 
Indo-US has a much higher number of stars (1273), with 
spectra ranging from 3460 A to 9464 A. They cover a fair 
range in atmospheric parameters. The main concern on this 
library regards its spectrophotometry, which was obtained 
by fitting each observation to a standard spectral energy 
distribution w ith a close ma tch in spectral type, using the 
com pilation of 'Picklcs' ('1998'). 

(Prugniel fc Soubiran. (,2001, ) pubhshed the ELODIE Li- 



brar£3, which has been updated since then. In its current 
version (Elodie.3) there are 1388 starts, in the wavelength 
range 4000 to 6800 A. Although it has a more limited wave- 
length coverage with respect to the others, it has a very high 
spectral resolution (R=10000 for flux calibrated spectra and 
R=42000 for flux normalised to the pseudo-continuum) . But 
the flux calibration of this library might be compromised by 
the use of an echelle spectrograph. 

Anothe r library that became availa b le recently is 
MILE^3 (^Sanchcz-Bla zguez et al] l2006l : ICenarro et ai] 
i2007l ). The spectra ranges from 3525 A to 7500 A, at a 2.3 A 
(FWHM) resolution. This library, with 985 stars, was care- 
fully created trying to fill the major gaps that existed in 
other empirical libraries. 

The Next Generation Stellar Library (NGST, Gregg et 
al. 2004) is yet another library soon to be publicly avail- 
able, which is an UV/optical (from 1660 to 10200 A) stellar 
spectral atlas using STIS-HST (PID 9786). The advantage 
of this library is that, being obtained with STIS at Hub- 



^ http:/ /www. ast.obs-mip.fr/users/leborgne/stelib/index. html 
http://www.noao.edu/cflib 



http: / /www. obs.u-bordeauxl.fr/m2a/soubiran/clodie_library.html 
http: / / www.ucm.es/info / Astrof/ miles / miles.html 
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Figure 2. The same as Figure[T] for stars representing a population of 10 Gyrs. 



ble Space Telescope, it presents an unprecedented internally 
consistent flux calibration across all wavelengths. 

Figure |3] shows the coverage in temperature and grav- 
ity of four empirical libraries (STELIB, Ind o-US, MILES 
and E LODIE), overplotted on isochrones from lGirardi et aP 
l|2002tl for ages 10 Myrs, 100 Myrs, 1 Gyrs and 10 Gyrs. All 
isochrones are for solar metallicity, which is the regime where 
the empirical libraries are more complete. The stars plotted 
are the ones with metallicity in the range -0.1 ^ [Fe/H] 5S 
0.1. It is clear that libraries have been improving in terms of 
parameter coverage, but this is a hard task and some holes 
still exist. Hot stars are missing in most of the libraries, be- 
ing really sparse towards O and B stars. Libraries tend to 
have the majority of stars for temperatures between 5000 K 
and 7000 K and there is a sudden drop for lower tempera- 
tures, specially below 4000 K. MILES has the best coverage 
for lower temperatures, while ELODIE is the most complete 
in the high temperature end. STELIB has only one O star, 
and only one dwarf below 4000K. Indo-US has no stars with 
Tcff > 27000K, and no dwarf below 4000K. 
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4.2 Spectral indices comparison 

A total of thirty-five spectral indices were measured in the 
spectra of the three high resolution synthetic libraries to be 
studied {Coelho, Martins and Munari), and on three of the 
empirical libraries (Indo-US, ELODIE and MILES). We se- 



Figure 3. Distribution of stars with solar metallicity in four em- 
pir ical libraries. The so lid lines are the solar metallicity isochrones 
by iGirardi et all ll200Cli ) for four different ages: black is lOMyrs, 
red is lOOMyrs, blue is IGyrs and red is 10 Gyrs. 
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lected all t he Lick/IDS indices as define d bv lWorthev et al] 
l| 19941 ) and IWorthev fc Ottavianl l|l997f ). We extended the 
wave length coverage adding th e index D4 000 fBalo gh et alJ 
1 19991 ). some indices de fined in iS erven et al . (2005) and the 
near infrared indices bv lDiaz et al. ( 19891 ). The indices cover 
the wavelength region ~ 3500A to ~ 8700A and are sum- 
marised in the Appendix (Table Al). All the indices ex- 
cept D4000 are defined by a central bandpass bracketed by 
two pseudo-continua bandpasses, which are used to draw a 
straight line to determine the continuum level at the feature. 
Atomic features are expressed in angstroms and molecular 
features in magnitudes. D4000 is defined using 100 A con- 
tinuum bandpasses to measure the break (3850 - 3950 A and 
4000 - 4100 A). 

We compared each synthetic library to each empirical 
library. For each empirical library, we selected all stars with 
metallicity in the range -0.1 ^ [Fe/H] ^ 0.1. For each star, 
the closest model in Teff and log g ([Fe/H] = 0.0) was se- 
lected in each of the synthetic libraries. The closest model 
was chosen based on the smaller distance (d) to the Tcff 
X log g plane, defined in equation [1] where Teg and log g 
are parameters of the models, and Tabs and (log g)o6s are 
parameters of the empirical libraries. 



/ / Jeff — Tots \ ' 



log g - {log g)obs 
{log g)obs 



(1) 



The typical parameter spacing of the models (250K in 
Tefi and 0.5 dex in log g) is of the same order of the ac- 
curacy of the atmospheric parameters in the empirical li- 
braries. Therefore, we believe the closest model is a reason- 
able approach. The theoretical libraries were degraded to 
the resolution of each empirical library prior to the mea- 
surements of the indices. The exception was the ELODIE 
library, whose superior resolution could only be matched 
by Coelho library. In this case the theoretical libraries and 
ELODIE were degraded to a common resolution of FWHM 
= 0.3A. 

Figures for all the comparisons are presented in the Ap- 
pendix (on-line material) . Figures 2] to [TO] show the results 
for some of the indices. The data points on the figures are 
the median values for each Tefr and log g bin in the empirical 
libraries, and the error bars are the correspondent one sigma 
dispersion of the empirical measurements for that parame- 
ter bin. A point with no error bar implies that there was 
only one star for that Tcft and log g bin. We colour coded 
the stars in three Toff intervals: blue squares are stars with 
Teft > 7000K, green diamonds are stars with 4500K < Tcff 
^5 7000K, and red asterisks are stars with Tcff ^ 4500K. 
The black crosses are stars with Tofr < 3500 K, but they 
are really rare. We also separated them by gravity: dwarf 
stars (log g ^ 3.0) are represented by filled symbols and 
giant stars (log g < 3.0) are represented by open symbols. 
The black line in each plot shows the one to one relation. 
The thick black symbols indicate the location of a Sun-like 
dwarf (cross; Tcfr = 5750K and log g = 4.5), and a typical 
Kl giant (diamond; T^ff = 4250K and log g = 1.5). The Kl 
giant have all parameters but metallicity close to the star 
Arcturus. We show the position of these particular stars on 
the plots because line lists are usually calibrated based on 
their high resolution spectra. Also shown in each plot is the 
adev value for each temperature range, a statistical mea- 



surement of how much each model is representing the stars 
in that range. Adev takes into account the distance of each 
theoretical point from the one-to-one line in the index plots, 
and is defined as: 



adev = 



-y 

N ^ 



(It 



(2) 



where N is the number of stars, It is the measure of the 
index on the theoretical library and le is the measure of the 
index on the empirical library. 

First thing to notice in these plots is that the error bars 
are non-negligible, specially for the low temperature stars. 
This is a consequence of the large uncertainties in the at- 
mospheric parameters of these stars. The determination of 
those parameters in cool stars is known to be a real chal- 
lenge. For the high temperature stars it is clear that the 
spread between each point is very small for most of the 
indices. This is somewhat expected, since there are fewer 
metallic lines as you go up in temperature, and therefore 
many of these indices will give essentially no information in 
this case. 

We organised the analysis grouping the results in four 
categories, related to the chemical species that dominate the 
index. It is worth remember tha t no index is sensib le to only 
one element (see e.g. tables at IServen et al.ll2005h . but we 
attempted to categorise the indices by its main element. 

Balmer lines: Include the indices H/3, H7A and H5yi. 
In general the hydrogen indices are well reproduced by all 
models down to 4500K. For the very low temperature stars, 
models start to deviate from observational libraries, clearly 
subestimating the indices, as shown in Figure|4]for H7A. It is 
known that Hydrogen lines computed in LTE match well the 
wings, but cannot reproduce the core of the fines. Fine tuned 
micro turbulence velocities or mixing length to pressure scale 
height ratio f/Hp were suggested in literature to improve 
the match in the solar spectrum (e.g. iFuhrmann et al.lll993l ; 
Ivan't Veer-Menneret fc Megessierl Il99d ). but the same pa- 
rameters would not necessarily improve the results for other 
spectral types. A realistic match would require NLTE com- 
putations of H lines, only available for very hot stars. Be- 
sides, the bottom of the hydrogen lines form in the chromo- 
sphere, not included in the model atmospheres grids. An- 
other point to note is that although theses indices are aimed 
at measuring H lines, in low temperature stars the actual 
hydrogen lines are considerably weak, and the metallic lines 
can be dominating the index. In this case, it is not clear if 
the main reason why the models are failing in reproducing 
the observed values is because of the non-satisfactory line 
core modelling, or because the dominance of uncalibrated 
metallic lines. 

C and N indices: Include the indices CN03862 , CNl, 
CN2 and G4300. According to'Tripicc o fc Belli (|l995l ) calcu- 
lations, the indices Ca4227 and Fe4668 are also highly sen- 
sitive to Carbon abundance variations, and therefore these 
two indices could be possibly included in this group. ^From 
these indices, the sub-sample that is sensitive to both C 
and N abundances (CN03862, CNl, CN2) show significant 
larger error bars, but the overall behaviour seem to be well 
matched by the models. Figure O that shows the CN2 index, 
illustrates this effect. On the other hand, indices that are 
mainly sensitive to C abundance variations (G4300, Ca4227 
and Fe4668) systematically deviate from the one to one line 
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Figure 4. Comparison of the index }i-fA measured in the empirical and theoretical libraries. Different symbols and colours represent 
three intervals of temperature: blue squares are stars with Teff < 7000K, green diamonds are stars with 4500K < Teff ^ 7000K and red 
circles are stars with Teff ^ 4500K. Filled and open symbols represent dwarfs (log g ^ 3.0) and giant stars (log g < 3.0), respectively. 
The black crosses are stars with Teff < 3500 K. The solid line is the one to one relation. The thick black symbols represent a Sun-like 
dwarf (cross) and an Arcturus-like giant (diamond). 



for stars cooler than T^g = 4500K. Figure E] shows the G4300 
index, which measures the G-band of CH at 4300A. One pos- 
sible reason for this effect is that the C and N abundances 
relative to Fe were assumed to be solar for all synthetic stars, 
but it is well known that the CNO-cycle lowers the C abun- 
dance and enhances the N abundance in giants (e.g. Ilbeni 
Ll967; Charbonncl 1994). The same effect on the indices CNi 
and CN2 would not be so clearly seen if the variations of C 
and N somewhat compensate each other. Nevertheless, we 
could not clearly attribute all the differences in these indices 
to the un-modelled CNO mixing. If the modelling of the 
CNO cycle was to be the only or major problem affecting the 
cool giants, we would expect the dwarfs (filled symbols; see 
e.g. Figure |6]) to be a closer match to the observations than 
the giants (open symbols). This is not the case, both present- 
ing similar patterns. Interestingly, for temperatures between 
4500K and 7000K, Coelho models reproduces considerably 
better the observations, while the cool end deviates more 
strongly than the other synthetic libraries. This is probably 
because the CH lines adopte d by Coelho models wer e com- 
puted with LIFBASE code (iLuque fc CroslevI \l99^ while 
Martins and Munari models adopt Kuruc j i 19931 ) molecular 
lines. This is a first indicative of how working on the line 
lists might impact the model results in significant ways. 

Iron peak elements: Many of the iron indices are good 
examples suggesting that working on the line lists might 
improve the model results significantly. Figure [7] shows the 
behaviour of the index Fe4383, where this effect is evident. 
Martins and Munari models have similar line lists, modified 



from the same iKuruc i (|1993| ) original work, while Coelho 
models employed its independent line list, based on high 
resolution stellar spectroscopy studies. The effect of the dif- 
ferent line lists is clearly seen. 

a elements: Include all the indices sensitive to Mg, Ca, 
Ti and O. In this case there is not a general pattern. Figure[S] 
shows the Mg2 index where the line list from Coelho repro- 
duces significantly better the observed values, specially in 
the low temperature regime. But it is interesting to point out 
that for stars cooler than T^s 4250K, this index is heav- 
ily contamina ted by TiO a molecular features (see Figure 
13 in . Coelho et al.ll2005l ). The Calcium and TiO indices, on 
the other side, are examples of how things can be complex. 
Figure[9]shows the index Ca4455. Coelho models tend to pre- 
dict slightly lower values than the observed. Munari models 
seem to show the same trend, to a lower level. At first order 
we could conc lude that both mod els are under-predicting 
this index, but iBensbv et al] (I2005D studied F and G dwarfs 
from the thin and thick disc of our galaxy and found that 
the [Ca/Fe] tend to be slightly super-solar for stars with 
[Fe/H] solar. In the likely case that the stars in the empiri- 
cal libraries show a similar behaviour than the one found by 
iBensbv et all l|2005l ). we should not expect the models, cal- 
culated with solar mixture ([Ca/Fe]=0), to actually match 
the observations. In this case, the behaviour of both Coelho 
and Munari models are consistent with the observations. 
Martins models show a more complex behaviour: interme- 
diate temperature stars, which were computed with SPEC- 
TRUM synthesis code and line lists and ATLAS9 models. 
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Figure 5. Comparison of the index CN2 measured in the empirical and theoretical libraries. This index measures the strength of the 
CNA4150 absorption band, in magnitudes. Symbols and colours are the same as in Figure |4] 
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Figure 6. Comparison of the index G4300 measured in the empirical and theoretical libraries. Symbols and colours are the same as in 
Figure |4] 
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Figure 7. Comparison of the index FeA4383 measured in tlie empirical and theoretical libraries. Symbols and colours are the same as 
in Figure |4] 



are overestimated; low temperature stars, calculated with 
PHOENIX models and line lists, are underestimated. Fig- 
ure[lO]shows the Ti02 index. This index has no meaning for 
stars with temperatures higher than 4500K, where there is 
no TiO in the spectrum to be measured. For lower tempera- 
ture stars the values raises rapidly, being extremely sensitive 
to temperature. This implies that uncertainties in the Toff 
adopted for stars in the empirical libraries (usually consider- 
ably higher for low temperature stars) make the comparison 
with models hardly reliable. Given the large uncertainties, 
models are not failing completely to reproduce this index. 



4.3 Dependence on the atmospheric parameters 

It is worth to keep in mind that errors on the empirical 
libraries, the most important one being uncertainties in the 
atmospheric parameters, hamper the comparison with the 
models. 

ELODIE library is the only of the empirical libraries 
that provides, for each star, a flag that indicates the accu- 
racy of each atmospheric parameter. In order to evaluate 
how much the accuracy might affect our comparisons. Fig- 
ures [TT] and [T2] show the same comparisons as before for the 
indices G4300 and Fe4531, but filtering the observed stars by 
the quality flag of the atmospheric parameters. On the first 
line of the figures all stars are plotted. On the second line, 
only stars with good and excellent flags for the atmospheric 
parameters. On the third line, only the ones with excellent 
determination. It is clearly noticeable how much the agree- 
ment between models and observations can change, based 
only on stars with very good parameter determinations. The 



Martins ct oL Munari ct al. Coelho et al 



■ All 


- All - 


^^^^^^^^^^^^^^^ 


■ Good and 


- Good and 
ExceHet^^ 


■ Good and 
Excel^^ 


■ Excellent 


- Excellent - 


■ Excellent 



-4-2 2 4 6 B 10-4 -202468 10-4 -2 2 4 6 8 10 

Observer^ 

Figure 11. Comparison of the index G4300 measured on the 
ELODIE library, filtering by the accuracy flags. First line has all 
the stars, second line shows only stars with good and excellent at- 
mospheric parameters, and the third line only stars with excellent 
flags. Symbols and colours are the same as in Figure |4] 



drawback, on the other hand, is that this filter limits dras- 
tically the number of points. 
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Figure 8. Comparison of the index Mg2 measured in the empirical and theoretical libraries. Symbols and colours are the same as in 
Figure |4] 
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Figure 9. Comparison of the index Ca4455 measured in the empirical and theoretical libraries. Symbols and colours are the same as in 
Figure |4] 
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Figure 10. Comparison of the index Ti02 measured in the empirical and theoretical libraries. Symbols and colours are the same as in 
Figure |4] 
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Figure 12. The same as Figure [TT] for the index Fe4531. 



4.4 Dependence on the flux calibration 

A second issue that can complicate the comparison between 
model and observations are related to flux calibrations un- 
certainties. One of the advantages of using spectral indices is 
that they were designed to be, as much as possible, insensi- 
tive to flux calibration issues. That implies that when using 
these indices to study the properties of stellar populations, 
the continuum shape is not used to extract information from 



the spectra. This is particularly useful when it is not possible 
to accurately flux calibrate the observations. 

In order to test how sensitive to flux calibration issues 
are the indices studied here, we employed a modified ver- 
sion of Coelho library. As explained in §2, a library focused 
on spectroscopic use is not suitable to predict broad-band 
colours because it does not generally include the full line 
blanketing. As the libraries stand now, our note to the stel- 
lar population modeller — which might be interested in using 
any of the synthetic libraries currently available — is that 
one has to find a compromise between a library which is good 
for spectrophotometric predictions or one which is good for 
spectroscopic studies. Until the accuracy of the predicted 
energ y levels lines is signiflcantly improved (see e.g. iKurucd 
l2006l l. the only way of achieving reasonable predictions for 
both broad-band colours and high resolution indices is by 
correcting the pseudo-continuum of current high resolution 
libraries to better match observed colours. In order to use 
the high resolution libr ary to build stellar population mod- 
els, ICoelho et al.l (j2007l) applies a corre ction to the original 
library presented in ICoelho et all (|2005h in order to compen- 
sate for the mentioned missing line opacity. In a few words, 
this correction was done by comparing each star in Coelho 
library to the correspondent flux distributions by ATLAS9 
grid. Smooth corrections to the continuum shape were ap- 
plied to the stars in Coelho library in order to better match 
the continuum shape of its correspondent flux distribution 
by ATLAS9. Therefore, the modifled Coelho library kept the 
high resolution features of the original library, but presents 
a flux distribution which is closer to that predicted when 
including all blanketing (ATLAS9). 
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Figure 13. Comparison between the colours predictions from 
two versions of Coelho library, with and without the empirical 
correction of the continuum as described in §4.4 (blue squares and 
green triangles respectively) . Red diamonds are the predictions by 
ATLAS9 models, for comparison. 



The effect of this correction is shown in Figure 1131 in a 
similar fashion of the broad-band colours figures at §3. AT- 
LAS9 flux distributions are shown as red diamonds, Coelho 
original library stars are shown as green triangles, and the 
blue squares arc the flux corrected stars (the modified Coelho 
library by Coelho et al. 2007) .The effect of the missing line 
opacity is clear, specially in the blue part of the spectrum. 

The spectral indices were then measured in the modified 
Coelho library and compared to the original measurements. 
These comparisons can show how smooth changes in the 
stellar pseudo-continuum can affect the measurement of the 
indices used in the present work. As expected, for most of 
the indices the differences between the two measurements 
are smaller than 3%. Among the classical Lick indices, only 
Ca4455 and Mgi are slightly more sensitive (~ 5%). 



Figure 14. Comparison between indices calculated for two ver- 
sions of Coelho library, with and without the flux correction due 
to missing line opacity. 



The notable exceptions are the indices D4000 and the 
three Ca indices in the near infrared, that showed a consider- 
able sensitivity to the modifications of the continuum shape 
(reaching above 10% in the most extreme cases). In Figure 
1141 we show the comparisons between the indices calculated 
with the original library {x axis) and the flux corrected one 
{y axis), and the residuals in the bottom panels. This high 
sensitivity of D4000 index to flux calibrations issues has also 
been noticed by G. Bruzual, V. Wild & S. Chariot (priv. 
comm.) 



4.5 The profile of the H lines in high temperature 
stars 

Balmer lines play a crucial role in the quantitative spec- 
tral analysis of hot stars. The Stark broadened wings de- 
pend on the photospheric electron density and, consequently, 
the stellar gravity log g. The line cores on the other hand 
are more sensitive to the effective temperature Tgff. Thus, 
the complete Balmer line profiles contain information about 
both fundamental atmospheric parameters, Teff and log g. 
The effects of NLTE were demonstrated to be of drastic 
importance since the pioneering work of lAuer fc Mihalaj 
(| 19721 ) ■ and have to be conside red in order to reproduce 
these lines. iMartins et all (|2005| ) already showed that this 
effect becomes more important with increasing T^ff, making 
a real difference for O and early B stars. 

Figure [15] shows a comparison between three hot stars 
from the ELODIE library (which is more complete for hot 
stars) and the theoretical libraries from Martins and Munari 
{Coelho library stops at 7000K). The hot stars in Munan 
library are also limited to log g equal to 4.5 or 5.0, while in 
the empirical libraries the hotter stars have 3.5 ^ log g ^ 4.0. 
The top line of the figure shows three Balmer lines for a star 
with Tcff ~21000K. In this case, both models are LTE. On 
the II/3 profile this might be the reason for not reproducing 
the very bottom of the line. The middle and bottom lines 
show two hotter stars (spectral type O), only represented in 
Martins library. For this temperature range Martins library 
consider NLTE computations, and all Balmer profiles are 
very well reproduced. 
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Figure 15. Comparison between models and observations for 
three of tlie balmer line profiles. Observations are shown in black, 
and models are given in blue (Martins) and red (Munari). The 
star identification and stellar parameters are shown in the plots. 



4.6 Summary 

The overall performance of the high resolution synthetic li- 
braries is summarised in Figure [TBI This figure shows the 
variation of adev for each theoretical library, split in the 
three Teff intervals. We did not considered observed stars 
that were significantly deviating from the other stars with 
similar Toft and log g. For each theoretical library and each 
index, the adev shown is the average of the adev values ob- 
tained by the comparison to the three empirical libraries 
(the results for each of the empirical libraries are given in 
the Appendix). 

The indices are shown on the x axis, in order of increas- 
ing wavelength. The dotted lines are linear fits of the adev 
values for each of the synthetic libraries (this fit does not 
take into account the near-IR indexes, since the only em- 
pirical library that covers this region is Indo-US). Although 
this figure cannot be seen as a precise measure of the quality 
of the models, it can highlight interesting patterns. 

First, all models are systematically deviating more in 
the blue part of the spectrum, where the blending of lines is 
considerably larger. To improve the quality of the line list, 
specially in the blue region and further in the UV is the 
aim of t he HST Treasury Program 9455 by Ruth Peterson 



aim 01 t ne tig I ireasury t^rogram ij4oo by Kutn t^eterson 
(see e.g. [Peterson et"al]|200ll . T20o3 ). and we confirm here 
that this is clearly the part of the spectrum that needs more 
work. 

Second, Coelho library is the one that has the best av- 
erage performance. This is likely a consequence of their line 
list, which was calibrated a long the years in several high res- 
olution stellar studi e s (e.g. Erdelvi-Mendes fc Barbuvll 19891 : 



ICastilho et al ] |l999l : iMelindez et al.ll2003l l . For stars hotter 
than 7000K Martins and Munari have similar results, but 
again, these indices are very weak and provide almost no 
information on this hot stars. A visual comparison of the 
Balmer lines profiles shows, nevertheless, that above Tefi ~ 
30000K, NLTE modelling is crucial. 



The values of adev are tabulated in the Tables in the 
Appendix. 



5 CONCLUSIONS 

With this work we aimed at pointing strengths and weak- 
nesses of current theoretical stellar libraries, focusing on the 
observable values that are mostly used in stellar population 
models. 

We divided our comparisons in two parts. In the 
first part, presented in §3, we measured broad-band 
colours predicted by three of th e most used mode l atmo - 
spheres grids cur r ently avail able: Castelli fc K uruc3 jgOOSh , 
iGustafsson et al.l l|2003D and lBrott fc Hauschi ldt (2003) . We 
compared the model prediction s with the recen t empirical 
colour-temperature relation by IWorthev fc Le3 (.2007), for 
the stars that are representative of a young and a old simple 
stellar population. The empirical relation is fairly well repro- 
duced by the models for the colours V-I, V-R and J-H. All 
models are a little too blue in the B-V and H-K plane. The 
biggest differences among the models, and also where they 
most deviate from the empirical relation, is seen in the U-B 
colour. ATLAS9 is the model grid that best represents the 
empirical relation (although a considerable improvement is 
still required), but presents slightly higher residuals in the 
visual bands. All colours of the cooler stars (Tch < 4500K) 
also need improvements 

The second part of our comparisons, presented in §4, fo- 
cus on the high resolution synthetic libraries that are aimed 
at spectroscopic studies. We measured thirty five spectral 
indices defined in the li terature on th r ee rec e nt high resolu- 
tion s ynthe tic libraries: Coelho et al.l (|2005l ) , [Martins et al.l 
(|2005[ ) and [Munari et al.l (|2005[ 'l. We compared the model 
indices with the observed measure ments given b y thre e 
high quality empirical l ibrari es by Valdes et al.[ ([2004h. 
[Sanchez-Blazauez et al.[ (|2006l ) and Prugniel fc SoubiranI 
(|200l| ). Our first result is that it is not trivial to compare 
model and empirical libraries because errors in the atmo- 
spheric parameters of the observed stars, and the partic- 
ular abundance pattern of the solar neighbourhood might 
mislead into wrong conclusions about the accuracy of the 
models. Overall, we found that models are systematically 
worse in the same two regions that (not surprisingly) need 
improvement in the model atmospheres grids: the very cool 
stars and the blue region of the spectra. In the very low tem- 
perature regime molecular and metallic lines dominate the 
spectra and models are clearly struggling to reproduce them. 
A special care in the blue part of the spectrum is needed. It 
is worth highlighting the effect of differ ent choices of atomic 
and molecular line lists. The library bv [Coelho et al.l (|2005[ ) 
employs a line list that have been refined along the years in 
high resolution stellar spectroscopic studies, and the effect 
is seen in its better average performance, specially in the 
indices of iron peak elements. T he indices are hardly use- 
ful for evalua t ing ho t stars. Only [Martins et al.l l|2005h and 
[Munari et al.l l|2005h libraries provide stars at the tempera- 
tures requi red for the modelling of young populations, and 
even then, [Munari et all ([2OO5I ) is limited to log g ^ 4.5, 
which were not present in the empirical libraries. A visual 
comparison of the Balmer lines profil es shows, as in previo us 
studies, that NLTE modelling (as in [Martins et~aill2005[ ') is 



16 Martins & Coelho 



Teff ^ 4b00K 
+ Mortins et ol. 
)K Munori et ol. 
Coelho et ol. 




4500 < Teff ^ 700C 
-|- Mortins et ol. 
)K Munori et ol. 
O Coelho et ol. 




■if s;: CM 
CD I cn 
in Q r- CO 
cj m m 

■< != O 

u ^ g 


gin 

O <o ^ 

I o 

o 


CM r-. ■if o < 1^ 
1 r-- CM <D o 1 CD 

^ ^ CM CM ^^ p- fO 

o ^ ^ -3- -if 3: ^ 

° -3 6 " -S 


O'-co^iri — rM-CjOLniocncM 
inroiDX-- c7'C7'c:nr-.Kiooco 
^iDin oss^cNm-tf-r- 
-3-^-=^ ininminin 


Q — CNOCOCICMr-- 

oOOcncn-ii-(DO 
:zrzp^"ififi(Oco 
CO CO CO 00 CO 

2 'Si 

tj o c5 ^ 










Teff > 7000 
-1- Mortins et ol. _ 
)^ Munori et ol. 























n ^ fn o o 
LT) In S In 



CO CO m (O 



Figure 16. Average values of adev for each index and each theoretical library. The panels show three intervals of temperature, labelled 
in the plot. Each point is the average adev given by the comparison with the three empirical libraries. Black crosses, red stars and blue 
diamonds represent the values for Martins, Munari and Coelho libraries respectively. 



required for the good reproduction of the Balmer line pro- 
files of very hot stars. More tests are needed in this sense, 
since these hot stars can have very different rotational veloc- 
ities which might affect the line profiles. Comparing models 
with slow rotation stars (which would not smooth the line 
profiles) would be the best way to test how much models are 
indeed able to reproduce these lines. 

As suggestions to the next generation of theoretical 
stellar libraries, we think that concerning the model atmo- 
spheres grids and fiux distributions, a better reproduction of 
the blue fiux (particularly important in the studies of young 
populations or any population at high redshift), and of the 
cool giants (that dominates the integrated spectra of old 
populations) are still required. It is unclear to the present 
authors (none of us being a stellar atmosphere modeller) if 
this is to be achieved by more sophisticated physics mod- 
elling (3D computations, NLTE, non-spherical effects, etc.), 
or improvements in the molecular, atomic and continuum 
opacities, or yet by adjusting model parameters (like mass, 
mixing-length parameter and micro-turbulent velocities) as 
a function of spectral type. Concerning the high resolution 



libraries, we believe that significant improvement can still 
be made by fine tuning the atomic and molecular line lists, 
through their calibration against high resolution stellar spec- 
tra whose atmospheric parameters — Toff , log g and detailed 
abundance ratios — are known very accurately. 
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